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after SIX (6) MONTHS from the mailing date of this communication. 

- If NO period for reply is specified above, the maximum statutory period will apply and will expire SIX (6) MONTHS from the mailing date of this communication. 
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DETAILED ACTION 



Election/Restrictions 

1. Restriction is required under 35 U.S.C. 121 and 372. 

This application contains the following inventions or groups of inventions which are not 
so linked as to form a single general inventive concept under PCT Rule 13.1. 

In accordance with 37 CFR 1.499, applicant is required, in reply to this action, to elect a 
single invention to which the claims must be restricted. 

Group I, claim(s) 1-4, 7-8, 12, and 16, drawn to polynucleotides, vectors and host cells 
comprising the same, and compositions and kits comprising the same. 

Group II, claim(s) 5-6, 14, and 19, drawn to polypeptides, and kits and compositions comprising 
the same. 

Group III, claim(s) 9, drawn to transgenic organisms. 

Group IV, claim(s) 10 and 18, drawn to antibodies, and compositions and kits comprising the 
same. 

Group V, claim(s) 1 1 and 15, drawn to a screening method utilizing a polynucleotide or cells 
comprising said polynucleotide. 

Group VI, claim(s) 13, drawn to a screening method utilizing a polypeptide. 
Group VII, claim(s) 17, drawn to a screening method utilizing an antibody. 

2. The inventions listed as Groups I- VII do not relate to a single general inventive concept 
under PCT Rule 13.1 because, under PCT Rule 13.2, they lack the same or corresponding special 
technical features for the following reasons: Group I is directed to a DNA comprising a base 
sequence encoding a polypeptide comprising the full length or a part of an amino acid sequence 
which is the same or substantially the same as an amino acid sequence represented by SEQ ID 
NO:l. However, since van der Zwaag et al. (Dev. Dyn. 225:336-343, 2002) teach a cDNA 
(accession no. AY1 16661) that encodes a PLEXIN-D1 polypeptide (See Fig. 1) that shares 92% 
sequence identity to SEQ ID NO:l (See attached sequence alignment), and thus encodes part of 
an amino acid sequence which is the same or substantially the same as SEQ ID NO:l, no special 
technical feature exists for group I as defined by PCT Rule 13.2, because it does not define a 
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contribution over the prior art. Because the technical feature of Group I is not a special technical 
feature, and because the technical features of the Groups II-VII inventions is not present in the 
Group I claims, unity of invention is lacking. Furthermore, the polynucleotides of Group I, the 
polypeptides of Group II, the transgenic organisms of Group III, and the antibodies of Group IV 
are structurally and functionally different chemical compounds, having different structures and 
activities, or in the case of the transgenic animals an organism, and each of which can be made 
and used without the other compounds. The methods of Groups V, VI, and VII require 
compounds which are functionally different from each other and each can be made and used 
without the other. Lack of unity is shown because these compounds lack a common utility 
which is based upon a common structural feature which has been identified as the basis for that 
common utility. 

Further Restriction Within Groups I- VII 

3. Whichever Group is elected, further restriction within the elected Group is required to 
one of the following: 

Applicants must further elect one polypeptide and the corresponding nucleic acid 
that encodes said polypeptide selected from SEQ ID NO: (encoded by SEQ ID 
NO:2), SEQ ID NO: 15 (encoded by SEQ ID NO: 16), or SEQ ID NO: 18 (encoded 
by SEQ ID NO: 19) 

4. The polypeptides and polynucleotide molecules do not relate to a single inventive 
concept under PCT Rule 13.1 because, under PCT Rule 13.2, they lack the same or 
corresponding special technical features for the following reasons: Each polypeptide and 
polynucleotide molecule represents a structurally and functionally different chemical compound 
from each other, having different chromosomal locations and sequences for the nucleic acids, 
and having different amino acid sequences, structures and activities for the polypeptides, each of 
which can be made and used without the other compounds. Accordingly, the methods of using 
the compounds are also, therefore, different methods. Lack of unity is shown because these 
compounds and methods lack a common utility which is based upon a common structural feature 
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which has been identified as the basis for that common utility. 

5. Applicants are advised that this is not a species election. 

6. The examiner has required restriction between product and process claims. Where 
applicant elects claims directed to the product, and the product claims are subsequently found 
allowable, withdrawn process claims that depend from or otherwise require all the limitations of 
the allowable product claim will be considered for rejoinder. Ah claims directed to a nonelected 
process invention must require all the limitations of an allowable product claim for that process 
invention to be rejoined. 

7. In the event of rejoinder, the requirement for restriction between the product claims and 
the rejoined process claims will be withdrawn, and the rejoined process claims will be fully 
examined for patentability in accordance with 37 CFR 1.104. Thus, to be allowable, the rejoined 
claims must meet all criteria for patentability including the requirements of 35 U.S.C. 101, 102, 
103 and 112. Until all claims to the elected product are found allowable, an otherwise proper 
restriction requirement between product claims and process claims may be maintained. 
Withdrawn process claims that are not commensurate in scope with an allowable product claim 
will not be rejoined. See MPEP § 821 .04(b). Additionally, in order to retain the right to rejoinder 
in accordance with the above policy, applicant is advised that the process claims should be 
amended during prosecution to require the limitations of the product claims. Failure to do so 
may result in a loss of the right to rejoinder. Further, note that the prohibition against double 
patenting rejections of 35 U.S.C. 121 does not apply where the restriction requirement is 
withdrawn by the examiner before the patent issues. See MPEP § 804.01. 
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8. Applicant is advised that the reply to this requirement to be complete must include 
(i) an election of a invention to be examined even though the requirement may be traversed (37 
CFR 1.143) and (ii) identification of the claims encompassing the elected invention. 

9. The election of an invention may be made with or without traverse. To reserve a right to 
petition, the election must be made with traverse. If the reply does not distinctly and specifically 
point out supposed errors in the restriction requirement, the election shall be treated as an 
election without traverse. Traversal must be presented at the time of election in order to be 
considered timely. Failure to timely traverse the requirement will result in the loss of right to 
petition under 37 CFR 1.144. If claims are added after the election, applicant must indicate 
which of these claims are readable on the elected invention. 

10. If claims are added after the election, applicant must indicate which of these claims are 
readable upon the elected invention. 

1 1 . Should applicant traverse on the ground that the inventions are not patentably distinct, 
applicant should submit evidence or identify such evidence now of record showing the 
inventions to be obvious variants or clearly admit on the record that this is the case. In either 
instance, if the examiner finds one of the inventions unpatentable over the prior art, the evidence 
or admission may be used in a rejection under 35 U.S.C. 103(a) of the other invention. 

12. Applicant is reminded that upon the cancellation of claims to a non-elected invention, the 
inventorship must be amended in compliance with 37 CFR 1.48(b) if one or more of the 
currently named inventors is no longer an inventor of at least one claim remaining in the 
application. Any amendment of inventorship must be accompanied by a request under 37 CFR 
1 .48(b) and by the fee required under 37 CFR 1 . 1 7(i). 
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Advisory Information 

Effective November 1, 2007, if applicant wishes to present more than 5 independent claims or 
more than 25 total claims in an application, applicant will be required to file an examination 
support document (ESD) in compliance with 37 CFR 1.265 before the first Office action on the 
merits (hereafter "5/25 claim threshold"). See Changes to Practice for Continued Examination 
Filings, Patent Applications Containing Patentably Indistinct Claims, and Examination of Claims 
in Patent Applications , 72 Fed. Reg. 46715 (Aug. 21, 2007), 1322 Off. Gaz. Pat. Office 76 (Sept. 
11, 2007) (final rule). The changes to 37 CFR 1.75(b) apply to any pending applications in 
which a first Office action on the merits (FAOM) has not been mailed before November 1, 2007. 
Withdrawn claims will not be taken into account in determining whether an application exceeds 
the 5/25 claim threshold. For more information on the final rule, please see 
http://www.uspto.gov/web/officesfo 

In response to the restriction requirement set forth in this Office action, applicant is required to 
file an election responsive to the restriction requirement. Applicant may not file a suggested 
restriction requirement (SRR) in lieu of an election responsive to the restriction requirement as a 
reply. A SRR alone will not be considered a bona-fide reply to this Office action. 

If applicant elects an invention that is drawn to no more than 5 independent claims and no more 
than 25 total claims, applicant will not be required to file an ESD in compliance with 37 CFR 
1 .265 that covers each of the elected claims. If the elected invention is drawn to more than 5 
independent claims or more than 25 total claims, applicant may file an amendment canceling a 
number of elected claims so that the elected invention would be drawn to no more than 5 
independent claims and no more than 25 total claims. 

If the restriction requirement is mailed on or after November 1, 2007, applicant is also required 
to file an ESD in compliance with 37 CFR 1.265 that covers each of the elected claims, unless 
the elected invention is drawn to no more than 5 independent claims and no more than 25 total 
claims taking into account any amendment to the claims. To avoid the abandonment of the 
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application, the ESD (if required) and the election must be filed within TWO MONTHS from 
the mailing date of this Office action. The two-month time period for reply is extendable under 
37 CFR 1.136. 



If the restriction requirement is mailed before November 1, 2007, the election must be filed 
within ONE MONTH or THIRTY DAYS, whichever is longer, from the mailing date of this 
Office action. The time period for reply is extendable under 37 CFR 1.136. Furthermore, if the 
elected invention is drawn to more than 5 independent claims or more than 25 total claims taking 
into account any amendment to the claims, the Office will notify applicant and provide a time 
period in which applicant is required to file an ESD in compliance with 37 CFR 1.265 covering 
each of the elected claims or amend the application to contain no more than 5 independent 
elected claims and no more than 25 total elected claims. 
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Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Jon M. Lockard whose telephone number is (571) 272-2717. 
The examiner can normally be reached on Monday through Friday, 7:00 AM to 4:30 PM. If 
attempts to reach the examiner by telephone are unsuccessful, the examiner's supervisor, 
Manjunath N. Rao, can be reached on (571) 272-0939. 

The fax number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or Public PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private 
PAIR system, contact the Electronic Business Center (EBC) at 866-217-9197 (toll-free). 




Jon M. Lockard, Ph.D. 
October 18,2007 



RT and mouse cDNA sequences. "; 

RL Proc. Natl. Acad. Sci. U.S.A. 99:168 99-16903(2002). 

RN [4] 

RP GLYCOSYLATION [LARGE SCALE ANALYSIS] AT ASN-500, AND MASS 

RP SPECTROMETRY. 

RX PubMed=16335952; DOI=10 . 1021/pr0502065; 

RA Liu T., Qian W.-J., Gritsenko M.A., Camp D.G. II, Monroe M.E., 

RA Moore R.J., Smith R.D.; 

RT "Human plasma N-glycoproteome analysis by immunoaf f inity subtraction, 

RT hydrazide chemistry, and mass spectrometry."; 

RL J. Proteome Res. 4:2070-2080(2005). 

CC -!- ALTERNATIVE PRODUCTS: 

CC Event=Alternative splicing; Named isoforms=2; 

CC Name=l; 

CC IsoId=Q9Y4D7-l; Sequence=Displayed; 

CC Name=2; 

CC IsoId=Q9Y4D7-2; Sequence=VSP_011516 ; 

CC -!- TISSUE SPECIFICITY: Detected at low levels in heart, placenta, 
CC lung, skeletal muscle, kidney, thymus and liver. Detected at very 

CC low levels in brain, colon, spleen, small intestine and peripheral 

CC blood leukocytes. 

CC -!- SIMILARITY: Belongs to the plexin family. 

CC -!- SIMILARITY: Contains 3 IPT/TIG domains. 

CC -!- SIMILARITY: Contains 1 Sema domain. 

CC 

CC Copyrighted by the UniProt Consortium, see http://www.uniprot.org/terms 

CC Distributed under the Creative Commons Attribut ion-NoDerivs License 

CC 

DR EMBL; AB014520; BAA31595.1; ALT_INIT; mRNA. 

DR EMBL; AY116661; AAM49063.1; -; mRNA. 

DR EMBL; BC003526; AAH03526.1; -; mRNA. 

DR EMBL; BC011848; AAH11848.1; -; mRNA. 

DR UniGene; Hs. 301685; -. 

DR Ensembl; ENSG00000004 399 ; Homo sapiens. 

DR HGNC; HGNC:9107; PLXND1 . 

DR MIM; 604282; gene. 

DR InterPro; IPR002909; IPT_TIG_rcpt . 

DR InterPro; IPR003659; Plexin-like. 

DR InterPro; IPR013548; Plexin_cytopl . 

DR InterPro; IPR002165; Plexin_repeat . 

DR InterPro; IPR008936; Rho_GAP. 

DR InterPro; IPR001627; Sema. 

DR Pfam; PF08337; Plexin_cytopl ; 1. 

DR Pfam; PF01437; PSI; 2. 

DR Pfam; PF01403; Sema; 1. 

DR Pfam; PF01833; TIG; 3. 

DR SMART; SM00429; IPT; 3. 

DR SMART; SM00423; PSI; 3. 

DR SMART; SM0 0630; Sema; 1. 

DR PROSITE; PS51004; SEMA; 1. 

KW Alternative splicing; Glycoprotein; Membrane; Polymorphism; Receptor; 

KW Repeat; Signal; Transmembrane. 

FT SIGNAL 1 4 6 Potential. 

FT CHAIN 47 1925 Plexin-Dl . 

FT /FTId=PRO_0000024676 . 

FT TOPO_DOM 47 1271 Extracellular (Potential) . 

FT TRANSMEM 1272 1292 Potential. 



RESULT 4 
PLXD1_HUMAN 

ID PLXD1_HUMAN STANDARD; PRT; 1925 AA. 

AC Q9Y4D7; Q6PJS9; Q8IZJ2; Q9BTQ2; 

DT 31-AUG-2004, integrated into UniProtKB/Swiss-Prot . 

DT 31-AUG-2004, sequence version 2. 

DT 25-JUL-2006, entry version 36. 

DE Plexin-Dl precursor. 

GN Name=PLXNDl; Synonyms=KIAA0620 ; 

OS Homo sapiens (Human) . 

OC Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 

OC Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 

OC Catarrhini; Hominidae; Homo. 

OX NCBI_TaxID=9606; 

RN [1] 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] (ISOFORM 1). 

RC TISSUE=Brain; 

RX MEDLINE-98403880; PubMed=97 34 8 11 ; DOI-10 . 10 93/dnares /5 . 3 . 169; 

RA Ishikawa K. , Nagase T., Suyama M., Miyajima N., Tanaka A., Kotani H., 

RA Nomura N . , Ohara O. ; 

RT "Prediction of the coding sequences of unidentified human genes. X. 

RT The complete sequences of 100 new cDNA clones from brain which can 

RT code for large proteins in vitro."; 

RL DNA Res. 5:169-176(1998). 

RN [2] 

RP NUCLEOTIDE SEQUENCE [MRNA] (ISOFORM 1), AND TISSUE SPECIFICITY. 

RX MEDLINE=22299888; PubMed=124 12018 ; DOI=10 . 1002/dvdy. 10159; 

RA van der Zwaag B . , Hellemons A.J.C.G.M., Leenders W.P.J. , 

RA Burbach J.P.H., Brunner H.G., Padberg G.W., Van Bokhoven H.; 

RT "PLEXIN-D1, a novel plexin family member, is expressed in vascular 

RT endothelium and the central nervous system during mouse 

RT embryogenesis . " ; 

RL Dev. Dyn. 225:336-34 3(2002). 

RN [3] '~' ~ 

RP NUCLEOTIDE SEQUENCE [LARGE SCALE MRNA] OF 1386-1925 (ISOFORMS 1 AND 

RP 2) . 

RC TISSUE=Muscle, and Uterus; 

RX MEDLINE=22388257; PubMed=12 4 7 7 932 ; DOI = 10 ..1073/pnas . 242603899; 

RA Strausberg R.L., Feingold E.A., Grouse L.H., Derge J.G., 

RA Klausner R.D., Collins F.S., Wagner L . , Shenmen CM., Schuler G.D., 

RA Altschul S.F., Zeeberg B., Buetow K.H., Schaefer C.F., Bhat N.K., 

RA Hopkins R.F., Jordan H., Moore T., Max S.I., Wang J., Hsieh F., 

RA Diatchenko L. , Marusina K., Farmer A. A., Rubin G.M., Hong L., 

RA Stapleton M. , Soares M.B., Bonaldo M.F., Casavant T.L., Scheetz T.E., 

RA Brownstein M.J., Usdin T.B., Toshiyuki S., Carninci P., Prange C, 

RA Raha S.S., Loquellano N.A., Peters G.J., Abramson R.D., Mullahy S.J., 

RA Bosak S.A., McEwan P.J., McKernan K.J., Malek J. A., Gunaratne P.H., 

RA Richards S., Worley K.C., Hale S., Garcia A.M., Gay L.J., Hulyk S.W., 

RA Villalon D.K., Muzny D.M., Sodergren E.J., Lu X., Gibbs R.A., 

RA Fahey J., Helton E . , Ketteman M., Madan A., Rodrigues S., Sanchez A., 

RA Whiting M., Madan A., Young A.C., Shevchenko Y., Bouffard G.G., 

RA Blakesley R.W., Touchman J.W., Green E.D., Dickson M.C., 

RA Rodriguez A.C., Grimwood J., Schmutz J., Myers R.M., 

RA Butterfield Y.S.N. , Krzywinski M.I., Skalska U., Smailus D.E., 

RA Schnerch A., Schein J.E., Jones S.J.M., Marra M . A . ; 

RT "Generation and initial analysis of more than 15,000 full-length human 



FT 


TOPO DOM 


1293 


1925 


Cytoplasmic (Potential) . 


FT 


DOMAIN 


47 


546 


Sema . 






FT 


DOMAIN 


891 


979 


IPT/TIG 1. 






FT 


DOMAIN 


981 


1066 


IPT/TIG 2. 






FT 


DOMAIN 


1069 


1160 


IPT/TIG 3. 






FT 


CARBOHYD 


86 


86 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


155 


155 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


188 


188 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


224 


224 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


481 


481 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


500 


500 


N-linked 


GlcNAc. . . 




FT 


CARBOHYD 


583 


583 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


696 


696 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


736 


736 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


802 


802 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


965 


965 


N-linked 


v GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


1017 


1017 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


1060 


1060 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


1099 


1099 


N-linked 


,GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


1118 


1118 


N-linked 


[GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


1132 


1132 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


1237 


1237 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


CARBOHYD 


1257 


1257 


N-linked 


GlcNAc. . . 


) (Potential) . 


FT 


VAR_SEQ 


1766 


1925 


SLPLRFWVNILKNPQFVFDIDKTDHIDACLSVIAQAFIDAC 


FT 








SISDLQLGKDSPTNKLLYAKEIPEYRKIVQRYYKQIQDMTP 


FT 








LSEQEMNAHLAEESRKYQNEFNTNVAMAEI YKYAKRYRPQI 


FT 








MAALEANPTARRTQLQHKFEQVVALMEDNI YECYSEA -> 


FT 








RWRPSSPVLGEHPEEPPVCL 


( in isof orm 2 ) . 


FT 








/FTId=VSP 


011516. 




FT 


VARIANT 


870 


870 


M -> V (in dbSNP: 2255703) . 


FT 








/FTId=VAR 


022144 . 




SQ 


SEQUENCE 


1925 


AA; 212095 


MW; 2 6001F5D0B2A8 0E5 


CRC64; 



Query Match 92.2%; Score 8468; DB 1; Length 1925; 

Best Local Similarity 91.9%; Pred. No. 0; 

Matches 1606; Conservative 52; Mismatches 88; Indels 2; Gaps 1; 

Qy 1 SMLNVAANHPNASTVGLVLPPTSGTGGSRLLVGATYTGFGSAFFPRNRSLEDHRFENTPE 60 

I I I I I I I I I I I I I I I I I I I M : I I I I I I I I I I I I I I : I I : I I I I I I I I I I I I I I I I I I 
Db 178 SMLNVAANHPNASTVGLVLPPAAGAGGSRLLVGATYTGYGSSFFPRNRSLEDHRFENTPE 237 

Qy 61 IAIRSLDARGDLAKLFTFDLNPSDDNILKIKQGAKEQHKLGFVRAFLHPAVPPHSAQPYA 12 0 

I I I I I I I I I I I I I I II I I I I I I I I I I I I i I I I I I I I I 1 I I I I I I I I I : I I M M 
Db 238 IAIRSLDTRGDLAKLFTFDLNPSDDNILKIKQGAKEQHKLGFVSAFLHPSDPPPGAQSYA 297 



Qy 121 YLALNSEARAGDKDSQARSLLARICLPRGAGGDAKKLTESYIQLGLQCAGGAGRGDLYSR 180 

I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 2 98 YLALNSEARAGDKESQARSLLARICLPHGAGGDAKKLTESYIQLGLQCAGGAGRGDLYSR 357 

Qy 181 LVSVFPAREQFFAVFERPQGAPGARNAPAALCAFRFDDVQAAIRAARTACFVEPAPDVVA 240 

I I I I I I I I I : 111111111:1 II I I I I II I I II I I : I I I I I I I I I I I II I I I I I I I 

Db 358 LVSVFPARERLFAVFERPQGSPAARAAPAALCAFRFADVRAAIRAARTACFVEPAPDVVA 417 



Qy 

Db 



241 VLDSVVQGTGPACESKRNIQLQPEQLDCGAAHLQHPLTILQPLRASPVFRAPGLTAVAVA 300 

I I I I I I I I I I I I I I I I I I I I M I I I I I I I I I I I I I : I I I I I : I : I I I I I I I i I : M I I 
418 VLDSVVQGTGPACERKLNIQLQPEQLDCGAAHLQHPLSILQPLKATPVFRAPGLTSVAVA 47 7 



Qy 


301 


Db 


478 


Qy 


361 


Db 


538 


Qy 


421 


Db 


598 


Qy 


481 


Db 


658 


Qy 


541 


Db 


718 


Qy 


601 


Db 


778 


Qy 


661 


Db 


838 


Qy 


721 


Db 


898 


Qy 


781 


Db 


958 


Qy 


841 


Db 


1018 


Qy 


901 


Db 


1078 


Qy 


961 


Db 


1138 


Qy 


1019 


nh 


1198 


Qy 


1079 


Db 


1258 


Qy 


1139 



SANNYTAVFLGTATGRLLKISLNESMQVVSRRVLTVAYGEPVHHVMQFDPMDPGYLYLMT 360 

I I I I I I I I I I I I I I I I I : I I I II I I I I I I I : I I I I I I I I I I I I I I I I I MUM 
SVNNYTAVFLGTVNGRLLKINLNESMQVVSRRVVTVAYGEPVHHVMQFDPADSVYLYLMT 537 

SHQMARVKVAACEVHSTCGDCVGAADAYCGWCTLETRCTLQQDCTNSSQPHFWTSASEGP 4 20 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I 
SHQMARVKVAACNVHSTCGDCVGAADAYCGWCALETRCTLQQDCTNSSQQHFWTSASEGP 5 97 

SRCPAMTVLPSEIDVHRDYTGMILQISGSLPSLSGMEMACDYGNGVRTVARVPGPAYDHQ 4 80 

I I I I I I I II I I I I I I : : I I I I I I I I I I I I II I I I I I I I I I II : I I I I I I I I I I : II 
SRCPAMTVLPSEIDVRQEYPGMILQISGSLPSLSGMEMACDYGNNIRTVARVPGPAFGHQ 657 

IAYCNLLPRAQFPSFPAGQDHVTVEMSVRVKGHNIVSANFTI YDCSRIGQVYPHTACTSC 54 0 

I I I I I II I I I I I II I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I II I I I 
IAYCNLLPRDQFPPFPPNQDHVTVEMSVRVNGRNIVKANFTIYDCSRTAQVYPHTACTSC 717 

LSTQWPCSWCIQLHSCVSNQSQCQDSPNPTSPQDCPQILPSPLAPVPTGGSQDILVPLTK 600 

111111 II I 11111111:1: I I I I I II I I I I : I I I I I I I I I I I I I : I I I I I 
LSAQWPCFWCSQQHSCVSNQSRCEASPNPTSPQDCPRTLLSPLAPVPTGGSQNILVPLAN 77 7 

ATFFHGSSLECSFGLEESFEAVWANNSLVRCNQVVLHTTQKSQVFPLSLKLKGPPDRFLD 660 

II I :: I I I I I I I I I I I I I I I I : I I I : I I I I I II : I I I I I I I I I : I I I I I I I I 
TAFFQGAALECSFGLEEIFEAVWVNESVVRCDQVVLHTTRKSQVFPLSLQLKGRPARFLD 837 

SPNPMTVVVYNCAMGSPDCSQCLGREDLGHLCVWNDGCRLRGPLQPLPGTCPAPEIRAIE 7 20 

II I I I I : I II I I I I I I I I I I I I I I II I I I I I : I : I I I I I I II I I I : I I I I I I I I I I I I 
SPEPMTVMVYNCAMGSPDCSQCLGREDLGHLCMWSDGCRLRGPLQPMAGTCPAPEIRAIE 8 97 

PLSGPLDGGTLLTIRGRNLGRRLSDVAHGVWIGSVACEPLADRYTVSEEIVCATGPAAGA 780 

I I I I I I I I I I I I I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I I 
PLSGPLDGGTLLTIRGRNLGRRLSDVAHGVWIGGVACEPLPDRYTVSEEIVCVTGPAPGP 957 

FSDVVTVNVSKEGRSREQFSYVLPTVHSLEPSMGPKAGGTRITIHGSDLNVGSMLQVLVN 84 0 

I I I I I I I I I I : I I : : I I I I I I I I I I I I : I I I I II I I I I I I I I : I I : I I I MINI 
LSGVVTVNASKEGKSRDRFSYVLPLVHSLEPTMGPKAGGTRITIHGNDLHVGSELQVLVN 1017 

DTDPCTDLTRTATS1TCTVPGGTLPSPVPVCVRFESRGCVHGNLTFWYMQNPVITAISPG 900 
111111:1 M III 11:1 I I I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II 
DTDPCTELMRTDTSIACTMPEGALPAPVPVCVRFERRGCVHGNLTFWYMQNPVITAISPR 1077 

RSPVSGGRTITVAGERFHMVQNVSMAVHHIGREPTFCKVLNSTLITCPSPGALSNASAPV 960 

J I I I I I I I I II I I I I I I I I I I I I I I I I I I I II I I I I I I II I I I I I I I I I I I I I I I I I I I 
RSPVSGGRTITVAGERFHMVQNVSMAVHHIGREPTLCKVLNSTLITCPSPGALSNASAPV 1137 

DFFINGRAYADE--/y\EELLDPAEAQRGSRFRLDYLPNPQFSTAKREKWIKHHPGEPLTL 1018 

MINIMI MINN I II I I I I I N I I I N I I I N I II I I I I I I I I I I I I I I 

DFFINGRAYADEVAVAEELLDPEEAQRGSRFRLDYLPNPQFSTAKREKWIKHHPGEPLTL 1197 

VIHKEQDSLGLESHEYHIKIGQVSCDIQI ISDRVIHCSVNESLGTAEGQLPITIQVGNFN 107 8 

I I I I I I I I I II : I I I I : I I I M II II I I : I I I : II I I I II I I I I I I I I II II II II I 
VIHKEQDSLGLQSHEYRVKIGQVSCDIQIVSDRIIHCSVNESLGAAVGQLPITIQVGNFN 1257 

QTIATLQLGGSETAIVVSIVICSVLLLLSVVALFVFCTKSRRAERYWQKTLLQMEEMESQ 1138 

II I I I I I II II I I I I : II II I II I I I II I I I N II I N I I I I I I II I I II II I I I I I I I I 
QTIATLQLGGSETAI IVSIVICSVLLLLSVVALFVFCTKSRRAERYWQKTLLQMEEMESQ 1317 

IREEIRKGFAELQTDMTDLTKELNRSQGIPFLEYKHFVTRTFFPKCSSLYEERYVLPSKT 1198 



Db 1318 IREEIRKGFAELQTDMTDLTKELNRSQGIPFLEYKHFVTRTFFPKCSSLYEERYVLPSQT 1377 

Qy 1199 LNSQGGSPPQETHPLLGEWNIPEHCRPSMEEGISLFSSLLNNKHFLIVFVHALEQQKDFA 1258 

Mill I I I I I I I I I I I I I I I M : I I I I I II I I I I I I I I M I II I I I I I I I I I I II 
Db 1378 LNSQGSSQAQETHPLLGEWKIPESCRPNMEEGISLFSSLLNNKHFLIVFVHALEQQKDFA 1437 

Qy 1259 VRDRCSLASLLTIALHGKLEYYTSIMKELLVDLIDASAAKNPKLMLRRTESVVEKMLTNW 1318 

I I I I I M I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 14 38 VRDRCSLASLLTIALHGKLEYYTSIMKELLVDLI DASAAKNPKLMLRRTESVVEKMLTNW 14 97 

Qy 1319 MSICMYGCLRETVGEPFFLLLCAIKQQINKGSIDAITGKARYTLNEEWLLRENIEAKPRN 1378 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 
Db 14 98 MSICMYSCLRETVGEPFFLLLCAIKQQINKGSIDAITGKARYTLNEEWLLRENIEAKPRN 1557 

Qy 1379 LNVSFQGCGMDSLSVRAMDTDTLTQVKEKILEAFCKNVPYSQWPRAEDVDLEWFASSTQS 1438 

I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I 
Db 1558 LNVSFQGCGMDSLSVRAMDTDTLTQVKEKILEAFCKNVPYSQWPRAEDVDLEWFASSTQS 1617 

Qy 14 39 YVLRDLDDTSVVEDGRKKLNTLAHYKIPEGASLAMSLTDKKDSTLGRVKDLDTEKYFHLV 14 98 

I : I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I : I I I I I I I I I I I I I I I I I 
Db 1618 YILRDLDDTSVVEDGRKKLNTLAHYKIPEGASLAMSLIDKKDNTLGRVKDLDTEKYFHLV 1677 

Qy 14 99 LPTDELVEPKKSHRQSHRKKVLPEI YLTRLLSTKGTLQKFLDDLFKAILSIREDKPPLAV 1558 

I I I I I I I I I I I II I I I I I I I I I I I M I I I I I I I I I I M I I I I I I I I I M I I I I I I I I I I ■ 
Db 1678 LPTDELAEPKKSHRQSHRKKVLPEI YLTRLLSTKGTLQKFLDDLFKAILSIREDKPPLAV 1737 

Qy 1559 KYFFDFLEEQAEKRGISDPDTLHIWKTNSLPLRFWVNILKNPQFVFDIEKTDHIDACLSV 1618 

I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I II : I I I I I I I I I I I 
Db 1738 KYFFDFLEEQAEKRGISDPDTLHIWKTNSLPLRFWVNILKNPQFVFDIDKTDHIDACLSV 1797 

Qy 1619 IAQAFIDACSISDLQLGKDSPTNKLLYAKEIPEYRKTVQRYYKQIQDMTPLSEQEMNAHL 1678 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M I I I I 
Db 17 98 IAQAFIDACSISDLQLGKDSPTNKLLYAKEIPEYRKIVQRYYKQIQDMTPLSEQEMNAHL 1857 

Qy 1679 AEESRKYQNEFNTNVAMAEI YKYAKRYRPQIMAALEANPTARRTQLQHKFEQVVALMENN 1738 

I I I I I I I I II I I I I I II I I I I I I I I I I I I I I M I I I I I I I II I I I I I M I I I I I I I I I : I 
Db 1858 AEESRKYQNEFNTNVAMAEI YKYAKRYRPQIMAALEANPTARRTQLQHKFEQVVALMEDN 1917 

Qy 1739 IYECYSEA 1746 

I I I I I I I I 
Db 1918 IYECYSEA 1925 



RESULT 6 
AY116661 

LOCUS AY116661 7095 bp mRNA linear PRI 14-NOV-2002 

DEFINITION Homo sapiens plexin Dl (PLXND1) mRNA, complete cds . 
ACCESSION AY116661 

VERSION AY116661.1 GI:24953986 

KEYWORDS 

SOURCE Homo sapiens (human) 

ORGANISM Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi ; 
Mammalia; Eutheria; Euarchontoglires ; Primates; Haplorrhini; 
Catarrhini; Hominidae; Homo. 
REFERENCE 1 (bases 1 to 7095) 

AUTHORS Van Per Zwaaq, B . , He 1 lemons , A. J. , Leenders, W. P. , Burbach, J . P . , 

Brunner,H.G., Padberg,G.W. and Van Bokhoven,H. 
TITLE PLEXIN-D1, a novel plexin family member, is expressed in vascular 

endothelium and the central nervous system during mouse 
embryogenesis 
JOURNAL Dev. Dyn. 225 (3), 336-343 (2002) 
PUBMED 12412018 — 
REFERENCE 2 (bases 1 to 7095) 

AUTHORS van der Zwaag,B. and van Bokhoven,H. 
TITLE Direct Submission 

JOURNAL Submitted ( 31-MAY-2002 ) Neurology, UMC Nijmegen, Reinier Postlaan 
4, Nijmegen, Gelderland 6525 GC, The Netherlands 
FEATURES Location/Qualifiers 
source 1. .7095 

/organism="Homo sapiens" 
/ mo 1 _t y p e = " mRN A " 
/db_xref="taxon: 9606" 
/chromosome="3" 
/map="3q21" 
gene 1. .7095 

/gene="PLXNDl" 
CDS 181. .5958 

/gene-"PLXNDl" 

/note="plexin family member" 
/codon_start=l 
/product="plexin Dl" 
/protein_id="AAM49063. 1" 
/db_xref="GI: 24953987" 

/translat ion="MAPRAAGGAPLSARAAAASPPPFQTPPRCPVPLLLLLLLGAARA 
GALEIQRRFPSPTPTNNFALDGAAGTVYLAAVNRLYQLSGANLSLEAEAAVGPVPDSP 
LCHAPQLPQASCEHPRRLTDNYNKILQLDPGQGLVVVCGSIYQGFCQLRRRGNISAVA 
VRFPPAAPPAEPVTVFPSMLNVAANHPNASTVGLVLPPAAGAGGSRLLVGATYTGYGS 
SFFPRNRSLEDHRFENTPEIAIRSLDTRGDLAKLFTFDLNPSDDNILKIKQGAKEQHK 
LGFVSAFLHPSDPPPGAQSYAYLALNSEARAGDKESQARSLLARICLPHGAGGDAKKL 
TESYIQLGLQCAGGAGRGDLYSRLVSVFPARERLFAVFERPQGSPAARAAPAALCAFR 
FADVRAAIRAARTACFVEPAPDVVAVLDSVVQGTGPACERKLNIQLQPEQLDCGAAHL 
QHPLSILQPLKATPVFRAPGLTSVAVASVNNYTAVFLGTVNGRLLKINLNESMQVVSR 
RVVTVAYGEPVHHVMQFDPADSVYLYLMTSHQMARVKVAACNVHSTCGDCVGAADAYC 
GWCALETRCTLQQDCTNSSQQHFWTSASEGPSRCPAMTVLPSEIDVRQEYPGMILQIS 
GSLPSLSGMEMACDYGNNIRTVARVPGPAFGHQIAYCNLLPRDQFPPFPPNQDHVTVE 
MSVRVNGRNIVKANFTIYDCSRTAQVYPHTACTSCLSAQWPCFWCSQQHSCVSNQSRC 
EASPNPTSPQDCPRTLLSPLAPVPTGGSQNILVPLANTAFFQGAALECSFGLEEIFEA 
VWVNESVVRCDQVVLHTTRKSQVFPLSLQLKGRPARFLDSPEPMTVMVYNCAMGSPDC 



SQCLGREDLGHLCMWSDGCRLRGPLQPMAGTCPAPEIRAIEPLSGPLDGGTLLTIRGR 
NLGRRLSDVAHGVWIGGVACEPLPDRYTVSEEIVCVTGPAPGPLSGVVTVNASKEGKS 
RDRFSYVLPLVHSLEPTMGPKAGGTRITIHGNDLHVGSELQVLVNDTDPCTELMRTDT 
SIACTMPEGALPAPVPVCVRFERRGCVHGNLTFWYMQNPVITAISPRRSPVSGGRTIT 
VAGERFHMVQNVSMAVHHIGREPTLCKVLNSTLITCPSPGALSNASAPVDFFINGRAY 
ADEVAVAEELLDPEEAQRGSRFRLDYLPNPQFSTAKREKWIKHHPGEPLTLVIHKEQD 
SLGLQSHEYRVKIGQVSCDIQIVSDRI IHCSVNESLGAAVGQLPITIQVGNFNQTIAT 
LQLGGSETAIIVSIVICSVLLLLSVVALFVFCTKSRRAERYWQKTLLQMEEMESQIRE 
EIRKGFAELQTDMTDLTKELNRSQGIPFLEYKHFVTRTFFPKCSSLYEERYVLPSQTL 
NSQGSSQAQETHPLLGEWKI PESCRPNMEEGI SLFSSLLNNKHFLIVFVHALEQQKDF 
AVRDRCSLASLLTIALHGKLEYYTSIMKELLVDLIDASAAKNPKLMLRRTESVVEKML 
TNWMSICMYSCLRETVGEPFFLLLCAIKQQINKGSIDAITGKARYTLNEEWLLRENIE 
AKPRNLNVSFQGCGMDSLSVRAMDTDTLTQVKEKILEAFCKNVPYSQWPRAEDVDLEW 
FASSTQSYILRDLDDTSVVEDGRKKLNTLAHYKIPEGASLAMSLIDKKDNTLGRVKDL 
DTEKYFHLVLPTDELAEPKKSHRQSHRKKVLPEIYLTRLLSTKGTLQKFLDDLFKAIL 
SIREDKPPLAVKYFFDFLEEQAEKRGISDPDTLHIWKTNSLPLRFWVNILKNPQFVFD 
IDKTDHIDACLSVIAQAFIDACSISDLQLGKDSPTNKLLYAKEIPEYRKIVQRYYKQI 
QDMTPLSEQEMNAHLAEESRKYQNEFNTNVAMAEIYKYAKRYRPQIMAALEANPTARR 
TQLQHKFEQVVALMEDNI YECYSEA" 
polyA_signal 7032. .7037 

/gene="PLXNDl" 
/not e= "putative" 

ORIGIN 



Query Match 67.3%; Score 4157.8; DB 5; Length 7095; 

Best Local Similarity 81.7%; Pred. No. 0; 

Matches 5023; Conservative 0; Mismatches 1002; Indels 124; Gaps 14; 



Qy 


1 


CCAGCATGCTCAACGTGGCCGCCAACCACCCCAACGCGTCCACTGTGGGACTGGTGCTGC 

1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II II 1 1 1 1 
CCAGCATGCTGAACGTGGCGGCCAACCACCCGAACGCGTCCACCGTGGGGCTAGTTCTGC 


60 


Db 


710 


769 


Qy 


61 


CGCCTACCTCGGGCACCGGGGGCAGCCGTCTGCTCGTGGGCGCCACGTACACCGGCTTCG 

1 II II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 II 1 1 II 
CTCCCGCCGCGGGCGCGGGGGGCAGCCGCCTGCTCGTGGGCGCCACGTACACCGGTTACG 


120 


Db 


770 


829 


Qy 


121 


GCAGCGCTTTCTTCCCGCGCAACCGTAGCCTAGAAGACCACCGCTTCGAGAACACGCCCG 

Mill 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 1 1 1 II 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 I 1 1 1 1 1 I 1 1 I 

GCAGCTCCTTCTTCCCGCGCAACCGCAGCCTGGAGGACCACCGCTTCGAGAACACGCCCG 


180 


Db 


830 


889 


Qy 

Db 


181 
890 


AGATCGCTATCCGCTCCCTGGACGCGCGTGGAGACTTGGCCAAGCTCTTCACCTTCGACC 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 II III 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGATCGCCATCCGCTCCCTGGACACGCGCGGCGACCTGGCCAAGCTCTTCACCTTCGACC 


240 
949 


Qy 


241 


T T AAC C C G T C GG AC GAT AAC AT C C T G AAG AT C AAG C AG G G C G C C AAGG AG C AG C AC AAG C 

1 1 1 1 1 1 II Mill 1 11 II M 1 1 1 1 1 M 1 II 1 II 1 1 1 II 1 M 1 1 1 1 II 1 1 1 II 1 M 
TCAACCCCTCCGACGACAACATCCTCAAGATCAAGCAGGGCGCCAAGGAGCAGCACAAGC 


300 


Db 


950 


1009 


Qy 


301 


TGGGCTTCGTGCGTGCCTTCTTGCACCCGGCGGTGCCACCGCACAGCGCGCAGCCCTACG 

1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 II 1 1 II 1 1 1 II 1 1 II 
TGGGCTTCGTGAGCGCCTTCCTGCACCCGTCCGACCCGCCGCCGGGTGCACAGTCCTACG 


360 


Db 


1010 


1069 


Qy 


361 


CGTACCTGGCGCTCAACAGCGAGGCGCGTGCGGGCGACAAGGACAGCCAGGCGCGCAGCC 

1 1 1 1 II 1 1 II 1 1 1 1 1 1 1 1 II M M II M 1 1 1 II 1 1 1 II 1 II 1 1 II 1 1 II II II II II 

CGTACCTGGCGCTCAACAGCGAGGCGCGCGCGGGCGACAAGGAGAGCCAGGCGCGGAGCC 


420 


Db 


1070 


1129 



Qy 4 21 TGCTGGCGCGCATCTGCCTGCCCCGCGGCGCGGGTGGCGACGCCAAGAAGCTCACCGAGT 4 80 



Db 197 0 CCAGCCGCTGTCCTGCCATGACCGTCCTGCCTTCCGAGATCGATGTGCGCCAGGAGTACC 2029 

Qy 1321 CAGGTATGATCTTACAGATCTCAGGAAGCCTGCCCAGCCTCAGCGGCATGGAGATGGCTT 1380 

I I I I I I I I I I 1 I I I I I I I I I l I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2030 CAGGCATGATCCTGCAGATCTCGGGCAGCCTGCCCAGCCTCAGTGGCATGGAGATGGCCT 208 9 

Qy 1381 GTGACTATGGAAATGGCGTTCGAACGGTGGCCCGGGTACCTGGCCCTGCCTATGATCATC 1440 

I I I I I II II I II I I II II I I I I I I I I M II I I I I I I I I II II I I I I 

Db 2090 GTGACTATGGGAACAACATCCGCACTGTGGCTCGGGTCCCAGGCCCTGCCTTTGGTCACC 214 9 

Qy 14 41 AGATTGCCTACTGCAATCTCCTGCCCAGGGCCCAGTTTCCATCCTTTCCTGCTGGCCAGG 1500 

I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 

Db 2150 AGATTGCCTACTGCAACCTCCTGCCGAGGGACCAGTTTCCGCCCTTCCCCCCCAACCAGG 2209 

Qy 1501 ACCACGTGACCGTTGAGATGTCTGTAAGGGTCAAAGGACACAACATTGTCTCAGCCAATT 1560 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I 

Db 2210 ACCACGTGACTGTTGAGATGTCTGTGAGGGTCAATGGGCGGAACATCGTCAAGGCCAATT 2269 

Qy 1561 TCACCATCTACGACTGCAGCCGAATTGGACAAGTCTACCCCCATACAGCCTGTACCAGCT 1620 

I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I I I I II I I I I I I I I I I I I I I I II 

Db 2270 TCACCATCTACGACTGCAGCCGCACTGCACAAGTGTACCCCCACACAGCCTGTACCAGCT 2329 

Qy 1621 GCCTGTCCACACAGTGGCCTTGCTCCTGGTGCATCCAGCTGCATTCATGTGTCTCCAACC 1680 

I II I I I I I I I II I I I I I II I I I I I I I I I I I I I I I I I II I I I I I I I I I I II 

Db 2330 GCCTGTCGGCACAGTGGCCCTGTTTCTGGTGCAGCCAGCAGCACTCCTGTGTTTCCAACC 238 9 

Qy 1681 AGTCTCAGTGCCAGGACTCGCCAAACCCCACGAGTCCTCAGGACTGTCCCCAGATCCTGC 17 4 0 

MINI I I I I III III I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I 

Db 2390 AGTCTCGGTGCGAGGCCTCACCAAACCCCACGAGCCCTCAGGACTGCCCCCGGACCCTGC 24 4 9 

Qy 1741 CCTCGCCCCTAGCGCCCGTGCCCACAGGTGGCTCCCAAGACATCCTGGTGCCCCTGACTA 1800 

I I I I II II I I I I I I I I I I II I I I I I I I I I I I I I I I I II I I I I I I I I I I I 

Db 2 4 50 TCTCACCCCTGGCACCCGTGCCTACGGGTGGCTCCCAGAACATCCTGGTGCCTCTGGCCA 2509 

Qy 18 01 AAGCCACCTTCTTCCATGGTTCCTCCCTCGAGTGCAGCTTTGGGCTGGAAGAGAGCTTTG 18 60 

I I I I I I I I I I I I I I I I I II I I II I II I I I I I I I I I I I I I I I I I I I 

Db 2510 ACACTGCCTTTTTCCAGGGTGCAGCCCTGGAGTGTAGTTTTGGGCTGGAGGAGATCTTCG 25 69 

Qy 18 61 AGGCTGTATGGGCGAATAACTCACTGGTCCGCTGCAACCAAGTGGTGCTGCACACAACCC 1920 

I I I I I I I I I I I I II I I II I II I I I I I I I I I I I I I I I I I I I I I I I I 1 I I 

Db 2 57 0 AGGCTGTGTGGGTGAATGAGTCTGTTGTACGCTGTGACCAGGTGGTGCTGCACACGACCC 2 62 9 

Qy 1921 AGAAGAGCCAGGTATTTCCACTGAGTCTGAAGCTGAAGGGGCCGCCAGACCGATTCCTAG 1980 

I I I I I I I I I I II II II II II II I II I I I I I I I I I I I I I I I I I I I I I I 

Db 2 630 GGAAGAGCCAGGTGTTCCCGCTCAGCCTCCAACTAAAGGGGCGGCCAGCCCGATTCCTGG 2 68 9 

Qy 1981 ACAGCCCTAACCCCATGACAGTTGTGGTCTACAACTGTGCTATGGGCAGCCCTGACTGTT 204 0 

MINIM I I I I I I I I I M I I I I I I I I II I I I I I I I M I I I I I II I I I II I II 

Db 2 690 ACAGCCCTGAGCCCATGACAGTCATGGTCTATAACTGTGCCATGGGCAGCCCCGACTGTT 274 9 

Qy 2041 CCCAGTGCCTGGGCCGTGAGGACCTGGGTCACCTCTGTGTTTGGAATGATGGCTGTCGTC 2100 

I I I I I I I I I I I I I I I I II I I I I I I II I II I I I II I I I M I I I II I I II II I 

Db 27 50 CCCAGTGCCTGGGCCGCGAAGACCTGGGTCACCTGTGCATGTGGAGTGATGGCTGCCGCC 2809 

Qy 2101 TAAGAGGGCCCCTGCAGCCACTCCCTGGCACCTGCCCAGCCCCTGAAATCCGAGCGATTG 2160 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I M M I I I I I I M II 
Db 2810 TGCGGGGGCCTCTGCAGCCCATGGCTGGCACCTGCCCCGCCCCCGAGATCCGCGCGATTG 2869 



1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

Db 1130 TGCTGGCGCGCATCTGCCTGCCCCACGGCGCCGGCGGCGACGCCAAGAAGCTCACCGAGT 1189 

Qy 481 CCTACATCCAACTGGGCTTGCAGTGCGCGGGCGGCGCGGGCCGCGGCGACCTCTACAGCC 54 0 

I I I I I II II I I I I I II I I II I I I I I I I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I 

Db 1190 CCTACATCCAGTTGGGCTTGCAGTGCGCGGGCGGCGCGGGCCGCGGCGACCTCTACAGCC 124 9 

Qy 541 GCCTCGTGTCGGTTTTCCCCGCGCGCGAGCAGTTCTTCGCCGTCTTCGAGCGGCCCCAGG 600 

MM M M II II I II I I M II I II I I I I I I II I II I I I I I II II I II II I I 

Db 1250 GCCTGGTGTCGGTCTTCCCAGCCCGGGAGCGGCTCTTTGCTGTCTTCGAGCGGCCCCAGG 1309 

Qy 601 GCGCCCCAGGTGCCCGCAACGCCCCGGCCGCGCTTTGCGCCTTCCGCTTCGACGACGTGC 660 

I I II I I I I II M II I I I I I I I I II I I I I I II I I I I I II I I II I I I II I 

Db 1310 GGTCCCCCGCGGCCCGCGCTGCTCCGGCCGCACTCTGCGCCTTCCGCTTCGCCGACGTGC 1369 

Qy 661 AGGCTGCCATTCGTGCAGCGCGCACCGCCTGCTTCGTGGAGCCGGCGCCCGACGTGGTGG 720 

II II II I II II I I II I I II II M I I I II I I I I II I I II II I I II I I I I I I I I I 

Db 1370 GAGCCGCCATCCGAGCTGCGCGCACCGCCTGCTTCGTGGAACCGGCGCCCGACGTGGTGG 1429 

Qy 721 CGGTGTTGGACAGTGTGGTGCAGGGCACCGGGCCGGCCTGCGAGAGCAAGCGCAACATAC 780 

I I I I I I I I I I I I I I I I II I II I I I I II I I II I I I I I I I I I II I I I II I I I I I 

Db 14 30 CGGTGCTCGACAGCGTGGTGCAGGGCACGGGACCGGCCTGCGAGCGCAAGCTCAACATCC 14 8 9 

Qy 781 AGCTGCAGCCGGAGCAACTGGATTGCGGAGCGGCCCACCTGCAGCACCCACTGACCATCC 84 0 

I I I I II I I I I I I I I I I I II II II II I II II II I I I I I II I I I Ml II I I I I 

Db 14 90 AGCTCCAGCCAGAGCAGCTGGACTGTGGAGCTGCTCACCTGCAGCACCCGCTGTCCATCC 154 9 

Qy 841 TGCAGCCGCTGAGGGCATCGCCCGTGTTCCGTGCTCCAGGGCTCACGGCCGTGGCTGTGG 900 

I I I I I I I I I II III I I I I I I II II I I I II II II I II II I I I I I II MM 

Db 1550 TGCAGCCCCTGAAGGCCACGCCCGTGTTCCGCGCCCCGGGCCTCACCTCCGTGGCCGTGG 1609 

Qy 901 CCAGTGCCAACAACTACACGGCCGTCTTTCTGGGCACCGCCACAGGGAGGCTCCTCAAGA 960 

M II I I I I I I M I I I I I I I I II II I I I I I M I I I I I I I M I I I I II II II 

Db 1610 CCAGCGTCAACAACTACACAGCGGTCTTCCTGGGCACGGTCAACGGGAGGCTTCTCAAGA 1669 

Qy 961 TCAGCCTGAACGAGAGCATGCAGGTAGTAAGCAGGCGAGTGCTGACTGTAGCCTATGGGG 1020 

Ml M M I I I II I M II I I I I I I I II II I I I I II III I II II II I I II II I I I I 

Db 1670 TCAACCTGAACGAGAGCATGCAGGTGGTGAGCAGGCGGGTGGTGACTGTGGCCTATGGGG 1729 

Qy 1021 AGCCTGTGCATCACGTCATGCAGTTTGACCCCATGGATCCTGGTTACCTATACCTGATGA 1080 

I I II I I II I I I II I II I I I I I I I II II I II I I I I I I I I I I I I I I I I I I 

Db 1730 AGCCCGTGCACCATGTCATGCAGTTTGACCCAGCAGACTCCGTTTACCTTTACCTGATGA 178 9 

Qy 1081 CATCCCACCAGATGGCCCGAGTGAAGGTGGCAGCGTGTGAGGTACACTCCACCTGCGGGG 114 0 

I I I I I I I I I I II II II I I I I I I I I I II I I M I II I I II II II I II II I I 

Db 17 90 CGTCCCACCAGATGGCCAGGGTGAAGGTCGCCGCCTGCAACGTGCACTCCACCTGTGGGG 184 9 

Qy 1141 ACTGCGTGGGTGCGGCCGATGCCTACTGTGGTTGGTGCACTCTGGAGACCCGGTGCACAC 1200 

I I I I I I I I I I I I M I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I II 

Db 1850 ACTGCGTGGGTGCGGCGGACGCCTACTGCGGCTGGTGTGCCCTGGAGACGCGGTGCACCT 1909 

Qy 1201 TCCAGCAGGATTGCACCAACTCCAGCCAGCCACATTTCTGGACCAGTGCCAGTGAGGGCC 1260 

I I I I II II I I I I I I I I I II II II I I I I I I I I M I I I I I I II I I II M M I M I I 

Db 1910 TGCAGCAGGACTGCACCAATTCCAGCCAGCAGCATTTCTGGACCAGTGCCAGCGAGGGCC 1969 



Qy 



12 61 CCAGCCGCTGCCCTGCCATGACAGT ACTGCCCTCGGAGATTGATGTGCACCGGGACTACA 1320 
I I I I I I I I I I I I II II II II I II II I II M I I II I I II II I I II III III 



2161 AGCCTCTGAGTGGCCCCTTGGACGGTGGGACTTTGCTGACCATCCGTGGCAGGAACTTGG 2220 

Mil I I I I I I I I I I I I I I I I I I I I I I I I I I I I II I I I I I I I I II I I I I I I III 
2870 AGCCCCTGAGTGGCCCGTTGGACGGTGGGACCCTGCTGACCATCCGAGGAAGGAACCTGG 2 92 9 

2221 GCCGTCGGCTCAGTGATGTGGCACATGGTGTGTGGATTGGCAGTGTGGCCTGTGAACCCC 2280 

I I I I II I I I I I I I I I I I I I I II II I I I I I I I I I I I I I I I I I I I I I I I I II I 
2930 GCCGGCGGCTCAGTGACGTGGCCCACGGCGTGTGGATTGGTGGTGTGGCCTGTGAGCCAC 298 9 

2281 TGGCTGACAGATACACCGTTTCAGAGGAGATCGTGTGTGCCACAGGGCCTGCCGCAGGGG 234 0 

II I I I I I II I I I I I I II II I I I I I I II I I I I I I I I I I I I I I I I I III I I I I 

2 990 TGCCTGACAGATACACGGTGTCGGAGGAGATCGTGTGTGTCACAGGGCCAGCCCCAGGAC 304 9 

2341 CCTTCTCAGACGTGGTAACGGTGAACGTCTCCAAGGAAGGCAGGTCTCGGGAACAGTTCT 24 00 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I l-l I I I I I I I 
3050 CACTCTCAGGTGTGGTGACCGTGAACGCCTCT AAGGAGGGCAAGTCCCGGGACCGCTTCT 3109 

24 01 CCTATGTGCTGCCCACGGTCCACTCACTGGAGCCTTCCATGGGCCCAAAGGCCGGGGGTA 24 60 

I I II I I I I I I I I I MINIMI I II I II I I I I II I I I II II I I I II II I I I I I 
3110 CCTACGTGCTGCCCCTGGTCCACTCCCTGGAGCCTACCATGGGCCCCAAGGCCGGGGGCA 3169 

24 61 CAAGGATCACCATTCACGGCAGTGACCTCAACGTGGGCTCTATGCTCCAGGTCCTGGTGA 2520 

I I I I I I I I I I I I II II I I I I I I I I I M I I I I I I I I I II I II M I I I I I I 
317 0 CCAGGATCACCATCCATGGGAATGACCTCCATGTAGGCTCCGAGCTCCAGGTCCTGGTGA 3229 

2521 ATGACACGGACCCCTGCACAGATCTTACGCGCACAGCCACCAGCATCACCTGCACTGTGC 2580 

I II I I I I I I II I I I I I I II M I II I I I I II II I I II II I I II I II I Ml 
3230 ACGACACAGACCCCTGCACGGAGCTGATGCGCACAGATACCAGCATCGCCTGCACCATGC 3289 

2581 CAGGGGGTACCCTGCCCTCTCCAGTGCCTGTGTGTGTGCGCTTCGAGAGCCGGGGCTGCG 2 64 0 

I I III I II II II I I II I I I I II I II I I I II I I I II I I I I I I II II I I I I II 

32 90 CTGAGGGGGCCCTGCCGGCTCCGGTGCCTGTGTGTGTGCGCTTCGAGCGTCGGGGCTGCG 334 9 

2641 TGCACGGAAACCTCACCTTCTGGTACATGCAGAACCCAGTCATCACAGCCATCAGCCCAG 2700 

II II I II II II II II II II II II II I II I I II I I I I I I II II I I II M II II II 

3350 TGCACGGCAACCTCACCTTCTGGTACATGCAGAACCCGGTCATCACGGCCATCAGTCCCC 34 09 

27 01 GCCGCAGCCCTGTCAGTGGCGGCAGGACCATCACTGTGGCTGGCGAACGCTTCCACATGG 27 60 

I I I II I I I I I II I II II II I I I I I I II I! I I I II II II II II II II II II I I I I II 
3410 GCCGCAGCCCTGTCAGTGGCGGCAGGACCATCACAGTGGCTGGTGAGCGTTTCCACATGG 34 69 

2 7 61 TGCAGAATGTATCAATGGCTGTACACCACATTGGCCGGGAGCCCACGTTCTGCAAGGTTC 282 0 

II II II II I I II I II II II I I I I I I I II II I I I I I II M I I I I M M I I M II II 

3 4 7 0 TGCAGAATGTGTCCATGGCCGTCCACCACATTGGCCGGGAGCCCACGCTCTGCAAGGTTC 352 9 

2821 TCAACTCCACACTCATCACCTGCCCATCTCCTGGAGCCCTGAGCAATGCTTCGGCGCCTG 2880 

II II II I II I II I I II I I II II II II II II I I II I I I II II II II II I II I 
3530 TCAACTCCACCCTCATCACCTGCCCGTCCCCCGGGGCCCTGAGCAACGCATCAGCGCCAG 358 9 

2881 TAGACTTCTTCATCAATGGCCGGGCATATGCAGACGAG GCAGCCGAGGAGCTGC 2 934 

I I II I II I I I II II I II I II I II I I II I I II I I I I I I I II II I M I 

3590 TGGACTTCTTCATCAATGGGCGGGCCTACGCAGACGAGGTGGCTGTGGCTGAGGAGCTAC 364 9 

2935 TGGACCCTGCAGAGGCACAGAGGGGCAGCCGGTTCCGCCTAGACTACCTCCCCAACCCAC 2 994 

I M M M I II I I I I II I II II M II I I II II I I II I II II II I II II I I I I I II 
3650 TGGACCCCGAGGAGGCACAGCGGGGCAGCAGGTTCCGCCTGGACTACCTCCCCAACCCAC 3709 



Qy 


2995 


Db 


3710 


Qy 
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Db 


3770 


Qy 


3115 


Db 


3830 


Qy 


3175 


Db 


3890 


Qy 


3235 


Db 


3950 


Qy 


3295 


Db 


4010 


Qy 


3355 


Db 


4070 


Qy 


3415 


Db 


4130 


Qy 


3475 


Db 


4190 


Qy 


3535 


Db 


4250 


Qy 


3595 


Db 


4310 


Qy 


3655 


Db 


4370 


Qy 


3715 


Db 


4430 


Qy 


3775 


Db 


4490 


Qy 


3835 



AGTTCTCCACAGCCAAGAGGGAGAAGTGGATCAAACATCACCCAGGAGAGCCGCTCACCC 3054 

I I I I I I I II I I I I I I I I I I I I I I I I I I I I I I I II I I I I I II I I I I I I I I I M I 
AGTTCTCTACGGCCAAGAGGGAGAAGTGGATCAAGCACCACCCCGGGGAGCCTCTCACCC 37 69 

TCGTCATCCATAAGGAGCAAGACAGCCTGGGGCTGGAGAGCCATGAGTACCACATCAAGA 3114 

I I I I I I I I I I II I I I I I I I I I I I M II II II I I I I II MINI I I I I I I 
TCGTTATCCACAAGGAGCAGGACAGCCTGGGGCTCCAGAGTCACGAGTACCGGGTCAAGA 3829 

TTGGCCAGGTGTCCTGCGACATCCAGATCATCTCAGACAGAGTCATCCACTGCTCAGTCA 317 4 

I II I II I I II I M I I I I I I I I I II I III I I I I II I I I I I I I I M I 

TAGGCCAAGTAAGCTGCGACATCCAGATTGTCTCTGACAGAATCATCCACTGCTCGGTCA 388 9 
ATGAGTCGCTGGGCACGGCTGAAGGACAGCTGCCCATCACAATCCAGGTGGGGAACTTCA 3234 

I Mill I I I I I I MM I II I I I I I I I I II II I I I I I I I I I I I I II II II I II 

ACGAGTCCCTGGGCGCGGCCGTGGGGCAGCTGCCCATCACAATCCAGGTAGGGAACTTCA 394 9 

ACCAGACCATCGCCACACTGCAACTGGGGGGCAGCGAGACGGCCATTGTGGTGTCCATCG 3294 

II I II I I I I M II I I I I I I I II I II I I I I I I M I I I I I II I II 

ACCAGACCATCGCCACACTGCAGCTGGGGGGTAGCGAGACGGCCATCATCGTGTCCATCG 4 00 9 

TCATCTGCAGTGTCCTGTTGCTGCTGTCTGTGGTTGCTCTGTTCGTCTTCTGCACCAAGA 3354 

II I I I I I I I I I I I I I I I I I I II I II I I I II II I I I I II II M I I I I I II II M 
TCATCTGCAGCGTCCTGCTGCTGCTCTCCGTGGTGGCCCTGTTCGTCTTCTGTACCAAGA 4 069 

GCCGCCGTGCCGAGCGCTACTGGCAGAAGACCCTGCTGCAGATGGAAGAGATGGAGTCTC 3414 

I I I I Mill Mill I I I I II II I I II I I M I I I II I I M I I I I I II II I I MM 
GCCGACGTGCTGAGCGTTACTGGCAGAAGACGCTGCTGCAGATGGAGGAGATGGAATCTC 412 9 

AGATCCGAGAGGAGATCCGTAAAGGCTTTGCGGAGCTGCAGACAGACATGACGGATCTCA 34 7 4 

I I I I II I i I I I II I I I I I I I I II I M II I I I II I I II I I I I I M II II II II M I 
AGATCCGAGAGGAAATCCGCAAAGGCTTCGCTGAGCTGCAGACAGACATGACAGATCTCA 418 9 

CCAAGGAGCTGAACCGCAGCCAGGGCATCCCCTTCTTGGAGTACAAGCACTTCGTGACTC 3534 

I I I I I I I I I I II I II M II I I II I I I I Ml II I I I II I I I I II II II II M I 

CCAAGGAGCTGAACCGCAGCCAGGGCATCCCCTTCCTGGAGTATAAGCACTTCGTGACCC 424 9 

GAACCTTCTTCCCCAAGTGCTCTTCCCTCTATGAAGAGCGGTATGTGCTGCCCTCGAAGA 3594 

I I I I I I I I I I I II II II I II II I II II I I I I II II I II I I I M I II II I III 
GCACCTTCTTCCCCAAGTGTTCCTCCCTTTATGAAGAGCGTTACGTGCTGCCCTCCCAGA 4 309 

CCCTCAACTCCCAGGGTGGCTCCCCGCCACAGGAAACCCACCCACTGCTGGGAGAGTGGA 3654 

I II I I II II II I I I I I I I M M I II I I I II II I II II M II I I II I II II I II II I 
CCCTCAACTCCCAGGGCAGCTCCCAGGCACAGGAAACCCACCCACTGCTGGGAGAGTGGA 4 369 

ACATCCCTGAACACTGTCGGCCCAGCATGGAGGAGGGGATCAGCCTGTTCTCCTCACTGC 3714 

I II II Mill Mill M III I I I II I I I I 

AGATTCCTGAGAGCTGCCGGCCCAACATGGAAGAGGGAATTAGCTTGTTCTCCTCACTAC 4 4 29 

TCAACAACAAGCACTTCCTCATCGTCTTCGTCCATGCTCTGGAGCAGCAGAAGGACTTCG 37 7 4 

I I I I I I I II II II I I I I II I II II M M Mill II II I I II I M II I II I II I II I 
TCAACAACAAGCACTTCCTCATCGTCTTTGTCCACGCGCTGGAGCAGCAGAAGGACTTTG 4 4 8 9 

CAGTGCGTGACAGGTGCAGCCTGGCGTCCCTGCTGACCATCGCGCTGCACGGCAAGCTGG 3834 

I I II I I II II I I I II II II I I I I II I I M I I I I II I I M M I I II II I I II I I I II 
CGGTGCGCGACAGGTGCAGCCTGGCCTCGCTGCTGACCATCGCGCTGCACGGCAAGCTGG 4 54 9 

3835 AGTACTATACGAGCATCATGAAGGAGCTGCTCGTGGACCTCATCGACGCCTCGGCGGCCA 38 94 



Db 


4 550 


Qy 


3895 


Db 
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Qy 


3955 


Db 


4 o / (J 
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4 015 


Db 


4730 


Qy 


4075 
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Db 


4 910 
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Db 
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4315 


Db 


5030 


Qy 


4375 


Db 


5090 


Qy 
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Db 


5150 


Qy 


4495 


Db 


5210 


Qy 


4555 


Db 


5270 


Qy 


4615 


Db 
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Qy 


4 675 



1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 

AGTACTACACCAGCATCATGAAGGAGCTGCTGGTGGACCTCATTGACGCCTCGGCCGCCA 4 609 

AGAACCCCAAGCTCATGTTGCGGCGCACGGAGTCTGTGGTGGAGAAGATGCTTACCAACT 3954 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I MUM I 

AGAACCCCAAGCTCATGCTGCGGCGCACAGAGTCTGTGGTGGAGAAGATGCTCACCAACT 4 669 

GGATGTCCATCTGCATGTACGGCTGCCTGAGGGAGACAGTAGGTGAGCCGTTCTTCCTGC 4 014 

I I I I M I M I I I I I I I M M MM III M II I I I II II I I I I I I M M M M I 
GGATGTCCATCTGCATGTACAGCTGTCTGCGGGAGACGGTGGGGGAGCCATTCTTCCTGC 4 72 9 

TGTTGTGTGCCATCAAGCAGCAGATCAACAAAGGCTCCATCGACGCCATCACAGGCAAAG 4 07 4 

I I I I II I I I M I II I I II I I I I I M I M I I I M I I M I I I M M M I II I II II I I 
TGCTGTGTGCCATCAAGCAGCAAATCAACAAGGGCTCCATCGACGCCATCACAGGCAAGG 4 7 8 9 

CCCGCTACACACTCAACGAGGAGTGGCTGCTGAGGGAGAACATTGAGGCCAAGCCCCGGA 4134 

M M M M M I II I II I M M M II II I I II I I I M I I I M II II II M M M M M 
CCCGCTACACACTCAATGAGGAGTGGCTGCTGCGGGAGAACATCGAGGCCAAGCCCCGGA 4 8 4 9 

ACTTGAACGTGTCCTTCCAGGGCTGTGGGATGGACTCCCTCAGCGTGCGGGCCATGGACA 4194 

I I II I I I I I II M M M I I II I I I M I I I I I I I I I II I I I II II II M M M M M 
ACCTGAACGTGTCCTTCCAGGGCTGTGGCATGGACTCGCTGAGCGTGCGGGCCATGGACA 4 909 

CCGACACGCTGACGCAGGTGAAGGAGAAGATCCTGGAAGCCTTCTGCAAGAACGTCCCCT 4 254 

II I I I I I M M II I I II I I I I II I I II I I II I I M I II I I II I II II I I II I I II 
CCGACACGCTGACACAGGTCAAGGAGAAGATCCTGGAGGCCTTCTGCAAGAATGTGCCCT 4 969 

ACTCACAGTGGCCGCGGGCGGAGGACGTGGACCTTGAATGGTTTGCCTCGAGTACCCAGA 4 314 

I I I I I II I II I I I I I II I I I I I I II I I II II I I Mill I II I I II II I I I I 
ACTCCCAGTGGCCGCGTGCAGAGGACGTCGACCTTGAGTGGTTCGCCTCCAGCACACAGA 502 9 

GCTACGTCCTCCGGGACCTGGATGACACATCAGTGGTGGAGGACGGCCGTAAGAAACTGA 4 37 4 

II I I I I I I I I II II I I I II I I I I I I II I II I M I I I II II II I I I II I I II I 
GCTACATCCTTCGGGACCTGGACGACACCTCAGTGGTGGAAGACGGCCGCAAGAAGCTTA 508 9 

ACACACTGGCCCACTACAAGATACCTGAGGGCGCCTCCCTAGCCATGAGCCTCACAGACA 4 4 34 

MM I I I I II I I I I I I II 11 I I I I I II I M II I I I I II I II II I I I I I I I II 
ACACGCTGGCCCATTACAAGATCCCTGAAGGTGCCTCCCTGGCCATGAGTCTCATAGACA 514 9 

AGAAGGACAGTACCCTGGGCAGAGTGAAAGACTTGGACACAGAAAAGTATTTCCATTTGG 4 4 94 

II II I I I I I II II II I I II I I II II II I II II II II I II M I I I I I II M I I II I 

AG AAG G AC AAC AC AC T GG GC C GAG T G AAAG AC T T G G AC AC AG AG AAG TAT T T C CAT T T GG 5209 

TGCTACCTACGGATGAGCTGGTAGAGCCTAAGAAATCTCACCGGCAGAGCCACCGCAAGA 4 554 

I I I I II II I I II I I I I II I II II I Mill I M I I II II I I I II I II I II I I I I 

TGCTGCCTACGGACGAGCTGGCGGAGCCCAAGAAGTCTCACCGGCAGAGCCATCGCAAGA 52 69 

AAGTATTGCCAGAGATCTACCTGACCCGCCTGCTGTCCACCAAGGGCACGCTGCAGAAGT 4 614 

I II I II II II I I I M II II I I I I I I I II I I II I I II I I II I I I I I II I I I I I 
AGGTGCTCCCGGAAATCTACCTGACCCGCCTGCTCTCCACCAAGGGCACGTTGCAGAAGT 5329 

TCCTAGATGACCTGTTCAAGGCTATCCTGAGCATCCGAGAGGACAAGCCCCCGCTGGCTG 4 67 4 

I II I II II M I I II I I II I I II I II II I I II I II II II II I II II II I I I I I 
TTCTGGATGACCTGTTCAAGGCCATTCTGAGTATCCGTGAAGACAAGCCCCCACTGGCTG 538 9 

TCAAGTATTTCTTTGACTTCCTAGAGGAACAGGCGGAGAAGAGAGGCATCTCCGACCCTG 4 734 
II II I I I I II I I I M I II M II I II II I I I 



Db 5390 TCAAGTACTTTTTCGACTTCCTGGAGGAGCAGGCTGAGAAGAGGGGAATCTCCGACCCCG 54 4 9 

Qy 4 7 35 ACACCCTGCATATCTGGAAGACCAACAGCCTTCCCCTGCGCTTCTGGGTGAACATCTTAA 4 7 94 

MM M M M M M M M I M M M M M I I M M M M M M M M I I I 

Db 54 50 ACACCCTACACATCTGGAAGACCAACAGCCTTCCTCTCCGGTTCTGGGTGAACATCCTGA 5509 

Qy 4795 AAAATCCCCAGTTTGTCTTCGACATAGAGAAGACGGACCACATCGACGCCTGCCTGTCTG 4854 

I M M M M M I M M I M M I M M M I M M I M M M M M I M I I M I 

Db 5510 AGAACCCCCAGTTTGTCTTTGACATCGACAAGACAGACCACATCGACGCCTGCCTTTCAG 5569 

Qy 4 8 55 TCATCGCACAGGCCTTCATCGATGCCTGCTCCATCTCTGACCTGCAGCTGGGCAAGGACT 4 914 

I I I I I II I I I I I M I I I I I I I I I I I I I II I I I II I I I M II I I I I M I I I I I I I I I I 

Db 557 0 TCATCGCGCAGGCCTTCATCGACGCCTGCTCCATCTCTGACCTGCAGCTGGGCAAGGATT 562 9 

Qy 4 915 CACCCACCAACAAGCTTCTGTACGCGAAGGAGATCCCTGAGTACCGGAAGACCGTACAGC 4 974 

j II I II II I I I I I I II Mill I I I I I I I I I I II I I I I I I I I I II I III I I I I 

Db 5630 CGCCAACCAACAAGCTCCTCTACGCCAAGGAGATTCCTGAGTACCGGAAGATCGTGCAGC 568 9 

Qy 4 975 GCTATTATAAACAGATCCAAGACATGACGCCGCTCAGCGAGCAGGAAATGAACGCACACC 5034 

MM II II I II I I I I I II II II II I I I I II I I I I II I I I II I I I II II II I 

Db 5690 GCTACTACAAGCAGATCCAGGACATGACGCCGCTCAGCGAGCAAGAGATGAATGCCCATC 574 9 

Qy 5035 TGGCCGAGGAGTCTCGGAAATACCAGAATGAGTTCAACACAAACGTGGCCATGGCTGAGA 5094 

II I I I II II II I I I II I I I II II II II II II II II I II II II I II II II II II I I 

Db 57 50 TGGCCGAGGAGTCGAGGAAATACCAGAATGAGTTCAACACCAATGTGGCCATGGCAGAGA 5809 

Qy 50 95 TTTATAAATATGCTAAGAGGTATCGACCACAGATCATGGCTGCCCTGGAGGCCAACCCCA 5154 

II II II I II II II II II I I II I II II II II II II I M II M II II II II II II 

Db 5810 TTTATAAGTACGCCAAGAGGTATCGGCCGCAGATCATGGCCGCGCTGGAGGCCAACCCCA 58 69 

Qy 5155 CAGCCCGCAGGACCCAGCTACAGCACAAGTTTGAACAGGTGGTGGCTCTGATGGAAAACA 5214 

I I I I I I I I II I M II I I I I II I I I II II I II I II II I II II II II II I III 

Db 58 70 CGGCCCGGAGGACACAACTGCAGCACAAGTTTGAGCAGGTGGTGGCTTTGATGGAGGACA 592 9 

Qy 5215 ATATCTATGAGTGTTACAGCGAGGCCTGATGCAGAAGAGTGACCAGGAGCTTCGGCCAGG 527 4 

I II I II I I I I I II II I M II I I I I I I I I I I II I I I I I 

Db 5930 ACATCTACGAGTGCTACAGTGAGGCCTGAGACACATG-GAGAGTTGGTCAGGCTGCTGCT 5988 

Qy 527 5 GAGACGGCGTGCAGGCCACTTGGCCTCCACTTGGT-TTCTTCCCCACATCTCTCACTTGG 5333 

III I I I I II I I II I I I I II II I II II I II I II I II I I 

Db 5989 GGGAGAAATGGACGCCCACTGGGCCTCAACTTGATCTTCTACCCCGTGCCTGTGACTCAG 6048 

Qy 5334 GCTGGGAACTGACAGAGGAGCCTGCTGGGCTAGGAGTGGGGGACACTGGCCTCTTAGTGC 5393 

I II II I I I I II II I 11 I II I II II 

Db 604 9 ACTGGGAAATACTGAGCAGAGACGGCTGGGGCGGGGGCAGGAGGAGGGGCTGCTCTCTG- 6107 

Qy 53 94 CCGGCTGCCGAGCTCTTGGCCTTGTCCCCTGGGGCATCTCTGTCCCCTCCACCTGCCCAA 54 53 

III I II I III III II III I II I I I M I I 

Db 6108 -AGACAGGGGCGCCCCCGCCTTGACCCCTGGGCACCTCCATCCCCTCCCACCTGTCCCCA 6166 

Qy 54 54 GACCCAACTCTAGGATGAAGGCCTTGAATATCGATCG CTGCCAGTCCCTA 5503 

Ml II II II II I II I II III I M II I II 

Db 6167 GATCAGTCTCTGGGATGGAGGCCAGAGAGCTGGTCAGGCTCCCCCATCTGCCCAGCACGG 622 6 

Qy 5504 ATAAGACTTTCCCTGCCAACCAGGACAGCCTGGACCATGCCTGCCTGTTCACTGT 5558 

III I II II II I I I I I II II II I I 

Db 6227 CCTGCACTGTGCCCACCCACTTGCTCCACAACGTCCAGTTGGTCCTGCTGCCAAGAGCCC 62 8 6 



Qy 5559 TTCAGGCTGCTCAGCACACATTGGGAGAG GTGGCCAT 5595 

I I I I I I I I I I I I I I I I I II I I I I I I I 

Db 6287 CGTGCATCCAGGCGGCCAAGCACAAACTGGGGGAGAGGAGGCCGCCAGCCCGGAGGCTGC 634 6 

Qy 5596 ATCCCAG-AACACTACCTCATCCACCTGGCAGAGGGAA 5632 

I I I I I I I I I I I I I I I I I I I I I I I I Ml 
Db 634 7 AGCCCAGAAACTCTACCTCATCCACACTGGTGCAGGGAGCCCTCCTTGAACTGACCTTTG 64 06 

Qy 5633 TTTCTGCTTCAGCCACCAAGCAGTTGTCT GTGTCCCTCATCCAGAGGGGGC 5683 

I I I I I I I I I I I I II I I I III Ml I I I II I I II I I I I 

Db 64 07 ATTGGTTTCTGCTTCAACTACCAAAATGTTATCTCCACTTCCCCCTCACCCGTAGAGGAT 64 66 

Qy 5684 CTTGGCCACCAACAGTTCCAAACCAGGTCAGCTGTTAGCCGTCTCATTGGCCAGTGGCAG 5743 

I I I I I I I I I I I I I I I I I I I I I I I II I I I III I I I I I 

Db 64 67 CCTGGCCACAGACAGTTTCAAGTAGTGTCAGATTTTTGTTGCTTGGGCGGCTGTTGGTAG 652 6 

Qy 57 4 4 CATGGGCAGTGCCCATTGC CCACAGAACGGTGGAGAGAGG 57 8 3 

I I I I I I I I I I M 

Db 6527 AGTGGGCAGTGCCCGCGCCATGGGGTGCTCTGTGGGCTTCTCCAGGAGCAGGGAGGGTGG 6586 

Qy 57 84 GGGACAGGCTGGGGG TTCCTGGCCCCAGGAAAGGGAGGAAGGCG 5827 

M III I I I I II I I I I I I I I II I I I I I I 

Db 6587 AGGGGAGGGATGGGGGGCACAGGAGCTGGGAGCCCCGTCTCCAGGAAAAGGAGAGGGGTT 664 6 

Qy 582 8 AGGATGCAG GGCTGTAGCTGGACTACTCAGTCTTCCTGGAAGTGTTTCTAAAGAGCA 5884 

I MINI MINIMI!! I I I I I I I I I II M I 

Db 664 7 AAGATGCACCGAGGCTGTAGCTGGGCTACTTGATCTTGCTGAAAGTGTTTCTAAAGATAG 67 06 

Qy 58 8 5 CCACTTTTTTTTGTTTTTTGTTTTTTAAGATVAAAAAAAACTTTTATATATTAAAACAAAA 594 4 

I I I I I I II I I I I I I I I II I I I I II I I I I I II M 
Db 67 07 CACCACTTTTTTTTTTAAAGCTTTTATATATTAAAAAACGTATCATGCACCAACTGTGAA 67 66 

Qy 5945 ACTTATGCACCAACTGTGAATAGCTGCCGCTTGTGCAGATCCCCAGGGGCTCCCGGTGAC 6004 

II II I I I I I I I I I I I I I I I I E II I I I 

Db 67 67 TAGCTGCCGCT TGCGCAGAGGACCCGGGGAGGGGTCCCGAGAGGCTCCCCATGCA 68 21 

Qy 6005 ACACTGGAAATGACTGTTCCAGGGGACAG 6033 

I II I I I II I I I I I I I I I I I I I I I I I 
Db 6822 ACACTGGAAATGACTGTTCCAGAGAGCGG 6850 



